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I^.  Introduction 

Estimation  problems,  and  filtering  among  them,  are  basically  concerned 
c  with  extracting  the  best  Information  from  Inaccurate  observation  of  signals. 

Perhaps  the  earliest  roots  of  this  type  of  prcolems  go  back  to  the  least 

fl- 

squares  estimation  at  the  time  of  Galileo  Galilei  In  1632  and  Gauss  In  1795. 

The  relatively  modern  and  more  general  development  of  least-squares  estimation 
In  stochastic  processes  Is  marked  by  the  work  of  A.N.  Kolmogorov  and  N.  Wiener 
In  the  1940's.  Most  recently,  and  due  to  vast  research  and  development  of  the 
space  age,  the  estimation  theory  experienced  a  new  outlook.  This  was  marked  by 
the  work  of  P.  Swerling  In  1958  and  1959  In  connection  with  satellite  tracking, 
and  the  work  of  R.  Kalman  using  state  space  approach.  Kalman's  work  [1]  had  the 
Impact  of  greatly  popularizing  and  spreading  the  estimation  theory  In  different 
fields  of  applications.  Also,  works  by  Stratonovich  and  Kushner  are  among  the 
recent  developments  of  the  subject. 
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From  the  control  theory  point  of  view,  the  problem  of  estimating 
the  state  dynamical  systems  plays  an  Important  role.  Very  often  the 
optimal  control  law  sought  for  a  dynamical  system  Is  some  sort  of  a 
feedback  of  its  state.  Take  for  example  the  control  of  a  chemical  process, 
a  nuclear  reactor,  maneuvering  of  a  space  craft,  guidance  and  navigation 
problems,  and  the  problem  of  control  and  suppression  of  structural  vibra¬ 
tions.  Also,  sometimes.  It  Is  of  Interest  to  know  the  state  of  a  dynamic 
system.  Take  for  example  the  tracking  of  moving  objects  like  satellites 
In  orbits,  and  enemy  missiles.  These  are  just  a  few  examples  of  the  appli¬ 
cation  of  this  knowledge. 

Fundamentally,  the  conditional  probability  density  of  the  state 
conditioned  on  available  observations  holds  the  key  for  all  kinds  of  state 
estimators.  The  case  of  the  linear  dynamical  system,  with  measurements 
linear  In  the  state  variables,  In  the  presence  of  additive  Gaussian  noise, 
and-under  the  assumption  of  full  knowledge  of  the  system  parameters  and  noise 
statistics,  has  been  optimally  solved.  In  that  particular  case,  the  condi¬ 
tional  probability  density  is  Gaussian.  A  Gaussian  density  Is  characterized 
by  only  two  quantities,  namely,  its  mean  and  covariance.  Therefore,  the 
optimal  linear  filter  has  a  finite  state,  the  conditional  mean  and  the  condi¬ 
tional  covariance,  and  Is  widely  known  as  the  Kalman  or  the  Kolman-Bucy  filter 
El]*  [2],  [3],  and  [4].  The  Kalman  filter  provides  the  minimum  variance 
unbiased  estimates.  Also,  the  filter  structures  Is  linear,  its  gain  and 
covariance  can  be  processed  Independently  of  the  estimate  even  before  receiving 
the  observations.  These  features  make  the  Kalman  filter  desirable  and  easy 
to  Implement. 


2 


Unlike  the  linear  case,  the  situation  for  no  ‘linear  systeus  Is 
completely  different.  The  conditional  probability  tensity  Is  no  longer 
Qausslan  even  though  the  acting  noise  Is  Itself  Gaussian.  In  this  case 
the  evolution  of  the  conditional  probability  density  1s  governed  by  a  stochastic 
Integral -partial  differential  equation,  Kushner's  equation,  or  equivalently 
by  an  Infinite  set  of  stochastic  differential  equations  for  the  moments  of  the 
density  function  [3],  [42],  and  [43].  Therefore,  the  truly  optimal  nonlinear 
filter  Is  of  Infinite  dimensionality,  and  consequently  Is  of  no  practical 
Interest.  Therefore,  practical  suboptlmal  finite  dimensional  filters  are  very 
much  needed. 

Inspired  by  Kalman's  results,  a  great  deal  of  research  effort  has  been 
directed  towards  extending  the  linear  results  and  developing  practical  schemes 
for  nonlinear  filters.  Developments  have  relied  on  two  main  approaches. 

The  first  approach  Is  based  on  the  linearization  of  system  nonl Inearltles 
around  a  nominal  trajectory  using  Taylor's  ser1  -s  expansion.  Performing  the 
expansion  up  to  the  first  order  terms  results  in  the  linearized  filter  [3],  and 
[11];  The  approach  can  further  be  Improved  by  linearizing,  again  up  to  a  first 
order,  about  the  most  recent  estimate.  Relinearization  Is  performed  as  more 
recent  estimates  become  available.  By  so  doing  the  well  known  extended  Kalman 
filter  (EKF),  [3],  Is  obtained.  The  Taylor's  series  expansion  can  be  carried 
lip  to  the  second  order  terms.  In  this  case,  with  some  assumptions  on  the  con¬ 
ditional  probability  density  function,  second  order  filters  are  obtained. 

Among  these  are  the  truncated  second  order  filter,  the  Gaussian  second  order 
filter,  and  the  modified  second  order  filter  (M2-F).  These  second  order  filters 
are  presented  In  [3],  and  [11], 
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In  the  second  approach  the  conditional  probability  density  function  Is 
approximated  using  several  techniques.  The  Gaussian  sum  approximation  Is 
used -In  [33],  and  [34].  In  this  case  the  conditional  probability  density  is 
approximated  by  a  finite  weighted  sum  of  Gaussian  densities  with  different 
means  and  covariances.  Since  the  Kalman  filter  Is  a  Gaussian  density  synthe¬ 
sizer,  then  the  resulting  Gaussian  sun  filter  Is  actually  a  bank  of  Kalman 
filters  working  In  parallel.  Each  one  Is  properly  tuned  In  terms  of  system 
parameters  and  Its  output  Is  properly  weighted  and  summed  to  other  filters' 
outputs  to  produce  the  state  estimate.  The  approach  has  been  used  extensively 

by  many  authors  to  treat  the  estimation  problai  of  linear  systems  with  unknown 

'  1 

parameters  e.g.  [35],  [36],  [37],  [38],  [39],  and  [40].  OrthoigSnal  series 
expansion  Is  also  used  to  approximate  the  conditional  probability  density  as 
In  [41],  Also,  the  Idea  of  generating  a  finite  set  of  moments  to  replace  the 
Infinite  set  for  the  true  density  has  been  Investigated  In  [44].  A  more 
detailed  account  and  discussion  of  the  above  mentioned  techniques  Is  given  by 
the  author  In  [61]. 

With  all  the  above  mentioned  approaches  for  developing  suboptimal  finite 
dimensional  filters,  still  the  task  of  theoretical  assessment  of  such  filters 
In  the  sense  of  providing  a  measure  of  how  far  a  suboptimal  filter  Is  from 
being  a  truly  optimal  has  remained  very  hard  to  achieve. 

It  Inherits  the  very  same  practical  difficult-  of  the  optimal  filter  - 
Infinite  dimensionality  -  that  one  Is  trying  to  avoid.  Therefore,  the  support 
of  any  such  schemes  has  to  rely  heavily  on  computer  simulation  and  for  that 
same  reason  not  a  single  scheme  can  be  claimed  always  superior.  For  example,  1 
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[11 ]i  the  truncated  second  filter,  the  Gaussian  second  order  filter,  the 
modified  second  order  filter  (M2-F),  the  extended  Kalman  filter  (EKF), 
and  the  linearized  filter  were  considered  In  numerical  simulation.  The 
linearized  filter  had  the  poorest  performance  but  no  conclusion  was  evident 
about  which  one  of  the  other  filters  Is  superior.  The  EKF  was  favored  for 
Its  relative  structure  simplicity  In  comparison  to  the  other  filters.  There¬ 
fore,  the  final  Judgement  Is  left  to  experience  and  the  special  case  at  hand. 
Consequently,  the  development  of  new  pv*act1cal  filters  will  add  to  the  1  ist 
of  contributors. 

The  main  theme  of  this  chapter  Is  to  consider  the  nonlinear  filtering 
problem  from  a  different  approach.  The  approach  taken  here  is  to  consider 
tho  problem  as  the  combination  of  approximating  the  system  description  and 
solvlnv^  the  filtering  problem  for  the  approximate  model.  As  a  result  some 
new  schemes  are  developed.  The  problem  formulation  and  the  proposed  solution  are 
given  next  followed  by  some  numerical  results. 
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n.  Problem  Formulation: 

Consider  the  general  nonlinear  clynamical  system  whose  state  x(t) 
evolves  in  time  according  to  the  following  differential  equation, 

dx(t)  «  [A(t)  x(t)  +  f(x(t),t)]  dt  +  Q^t)  dW(t)  (1) 

x(to)  -  ,  t  >  tQ 

where 

x(t}  G  r''  Is  an  'n'  dimensional  state  vector. 

A(t}  Is  an  'nxn'  real  matrix. 

f(x(t},t)  Is  an  'n*  dimensional  vector  valued  real  function. 

Xq  c  Is  an  *n'  dimensional  Gaussian  random  vector  (GRV)  with 
E  IXjj}  -  Xp  *  (2) 

r 

and 

Cov(Xq,Xq)  a  E  {(Xp  -  Xp)(x^  -  x^)'}  ■  (3) 

W(t)  c  r”  Is  an  'n'  dimensional  Wiener  process,  and 
dW(t)  ■  W(t+dt)  -  W(t).  Therefore, 

E  {dW(t)}  -  0  for  all  t  >  t^  (4) 

and 

Cov(dW(t),dW(t))  a  E  {dW(t)  dW'(t))  •  (Idt)  (5) 

Where  I  Is  the  (nxn}  unit  matrix. 

Q**(t)  Is  a  real  matrix,  and 

Q(t)  ■  Q**(t)  (t)  Is  a  positive  semidefinite  (nxn) 

matrix. 

*  £{•)  denotes  the  expected  value  of  (•) 

+  Cov(*,*)  denotes  the  covariance  of  (•). 


(6) 


Also*  consider  the  observations  process  dy(t}  to  be  given  by 
dy(t)  ■  [C(t)  x(t)  +  h(x(t).t)]  dt  ♦  R**(t)  dv(t) 

where 

dy(t)  e  R*"  Is  an  'n'  dimensional  observations  vector. 

C(t}  Is  an  'mxn*  real  matrix. 
h(x(t),t)  Is  an  'm*  dimensional  vector  valued  real  function. 
v(t)  e  R"*  Is  an  'm'  dimensional  Wiener  process,  and 
dv(t)  -  V(t  +  dt)  -  V(t).  Therefore, 

E  {dv(t)}  ■  0  for  all  ta-t^  (7) 

and 

Cov(dv(t),  dv(t))  ^  E  (dv(t)  dv'{t)  )  »  (Idt)  (8) 

R’^(t}  Is  a  real  matrix,  and 

R{t)  •  R^^lt)  R**  (t)  Is  a  positive  definite  (nxn)  matrix 

We  assume  that  x^,  w(t),  and  v(t}  are  all  independent  of  each  other 
for  all  values  of  tet^.  Also,  the  assumption  that  equation  (1)  satisfies 

the  conditions  for  existence  and  uniqueness  of  solution  given  In 
[3],  [23],  and  [57]  1s  being  made.  This  means  that  our 
dynamical  system  (1)  admits  only  one  solution  x(t),t>tp  to  be  its  state 
trajectory  In  the  mean  square  sense.  Furthermore,  It  Is  assumed  that 
both  f(x(t},t)  and  h(x(t},t)  are  continuous  in  x(t}. 

As  It  Is  noticed  from  equations  (1),  and  (6),  the  system  structure 
Is  considered  to  be  composed  of  two  parts,  a  linear  part  plus  a  non¬ 
linear  part.  Furthermore,  we  assume  that  the  system  behavior  Is  dominated 
by  Its  linear  part,  That  Is  to  say, 

Ilf(x(t).t)|l  <  ||A(t)x(t)|| 


(9) 


and 

||h(x(t),t)||<  ||C(t)x(t)||  (10) 

where 

||z||  Is  the  norm  of  the  vector  z. 

Equations  (1)  and  (6)  along  with  conditions  (9)  and  (10)  can  be  the 
original  system's  description,  what  is  sometimes  referred  to  as  system 
with  conebounded  nonllnearltles.  Also,  It  can  be  a  representation 
obtained  by  linearization  of  a  nonlinear  system,  where  f(x(t),t} 
and  h(x(t},t)  represent  second  and  higher  order  terms.  In  this  case 
conditions  (9)  and  (10)  are  valid  as  long  as  the  system  state  x(t} 
remains  within  a  small  neighborhood  of  the  nominal  (linearizing) 
trajectory. 

Accordingly,  conditions  (9),  and  (10)  suggest  that  for  a  good  guess  of 
the  system  state  x*(t)  the  following  approximate  equations  for  the 
dynamics  and  observations  can  be  written  as 

dx^(t)  -[A(t)  x^(t)  4  f(x*(t),t)  ]  dt  +  Q‘*(t)  dw(t)  (11) 

ily(t)  .  [C(t)  »,(t)  ♦  h(x*(t),t)  ]  dt  +  l>\t)  dv(t)  02) 

By  virtue  of  continuity  of  the  nonllnearltles  In  x(t),  we  should  note 
’  the  following.  As  x*(t)  approaches  x^(t),  the  approximate  description 
given  In  (11),  and  (12)  approaches  the  true  description  In  (1),  and  (6). 
In  fact,  the  following  equation  _  . 

dx^(t)  «[A(t)x^(t)  4  f(x^(t),t)]  dt  4  Q‘*(t)  dw(t), 

*(‘o'  '  *o* 
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^r>d  eoiatlon  (1)  have  the  same  solution  both  in  the  mean  square  sense 
and  with  probability  one. 

Thus  follows,  the  filtering  problem  of  thi  system  (1),  (6)  can  be 
considered  as  a  unification  of  model  approximation  and  state  estirtlation 
of  the  approximate  model.  In  other  words,  first  we  approximate  the 
system  description  by  finding  a  suitable  x*(t).  Then,  solve  the  optimal 
filtering  problem  of  the  approximate  model.  The  optimal  filtering  is 
basically  to  seek  the  minimum  mean  square  error  estimate  of  the  state 
x(t)  based  on  the  available  observations,  Y^»[y(s),  t^^ssit]. 


Generally,  according  to  theorem  (6.6)  of  [3]  pp.  184 
and  its  specialization  to  linear  systems,  theorem  (7.3)  pp.  219  of  the 
same  reference,  the  optimal  filter  Imitates  the  dynamics  of  the  system 
and  is  linearly  driven  by  the  net  observations.  Therefore,  guided  by 
these  results,  we  will  seek  the  optimal  filter  for  the  system  in  (11)  and 
(12)  as  a  linear  dynamic  system  driven  linearly  by  the  net  observations. 
The  optimality  of  the  filter  Is  in  the  sense  of  achieving  minimum  mean 
square  error. 

so,  if  we  define  the  estimation  error  e^(t)  as 

®l(t)^x^.(t)  -  x^(t)  (-14) 

and  the  covariance  matrix  P(t)  as 

P(t)4  E((e,(t)  -  e(t))(e,(t)  -  e,(t))')  (15) 

.  Where  x,(t)  is  an  estimate  of  x^(t)  based  on  Y^,  and 

¥^(t)  ^  e  {e,(t))  (16) 

then, 


is 


a(e,(t))  -  tr  (E(e^(t)e',(t)}) 

■tr(P(t))  +  tr(e,(t)  e{(t)) 
to  be  minimized.  ' 


(17) 
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III.  Proposed  Solutions 


According  to  the  approximate  model  in  equations  (11)  and  (12), 
the  minimum  variance  unbiased  estimate  (t)  is  given  by  a  Kalman  filter 
which  has  the  following  expression 

(t)  -  [A  (t)x^(t)  +  f(x*(t),t)3  dt  +  K(t)[dy(t)  ' 

-C(t)^^(t)dt  -  h(x*(t),  t)dt] 

"l^V-^o 

K(t)  •  P(t)C'(t)R"^t) 

dP(t)  ■  [A(t)P(t)  +  P(t)  A'{t)  -  P(t)C'(t)R"''(t)C(t)P(t) 

+  Q(t)]  dt 


A  well  known  property  of  the  Kalman  filter  is  that  x^(t)  is  the 
conditional  expectation  of  x^(t)  given  the  measurements  Y^,  i.e 

x^  (t)  »  (t)} 


“  ''**1 
1.  fji  .">■  V.Vi’.  «. » .y.  ."..I 


According  to  the  argument  following  equations  (11)  and  (12),  x  (t) 
is  required  to  provide  the  optimal  solution  of  the  following  minimization 
problem. 

min  J(x  (t))  -  Ey  |(x,(t)  -  x*(t))'(x^(t)  -  x*(t))|  05) 

*  ^ 

X  (t) 

then  for  every  t  S  t^  setk  ng  3J(x  (t))/3x  (t)  ■  o  we  get 

x*(t).E^  {x,(t)|  -  *,(t)  (20) 

Therefore,  combining  the  results  of  equations  (i8)i  ®nd  (go)  we  get 
the  following  filter,  to  be  denoted  as  the  (El-F)  filter, 

namely, 

dx  (t)  -[A(t)i  (t)  +f(x  (t),t)]dt  +  K(t)  [dy(t) 

-C(t)  X  (t)  dt  -h(x  (t),t)  dt]  ,  X  (tp)  ■  Xp  (21 ) 

K(t)  -  P(t)C'(t)R'’'’(t)  (22) 

dP(t)  -  j^A(t)P(t)  +  P(t)A'(t)  -  P(t)C'{t)R’'’(t)C(t)P(t)  +Q(t)j  dt 

P(t^)  .  P^  (23) 

It  is  straightforward  to  recognize  that  in  case  of  a  linear  system, 
i.e.  f{x(t),t)  and  h(x(t).t)  are  identically  zero  or  only  functions  of 
time,  equations  (gl)*  (22)  and  (23)  reduce  to  the  well  known  Kalman 
filter. 


n 


The  extended  Kalman  filter  (EKF),  [3] 

(1)  and  (6)  la  given  by  the  following  equations. 

t 

d^(t)  ■  [A(t)^(t)+f(x(t),t)]dt  +  K(t)  [dy(t)  - 

C(t)  X  (t)  dt  -  h(x(t),t)dt].  -  x^  (24  ) 

K(t)  -  P(t)  [C{t)  +  h^(x(t).t)]V‘’(t)  (25) 

dP(t)  -  {[A(t)  >  f^(x(t),t)]  P(t)  +  P(t)  [A(t)  +  fjj(x(t),t)  3^ 
-P(t)(C(t)  4  h^(x(t).t))‘  R“^t)(C(t)  4  h^(1^(t),t)). 

P(t)  4  Q(t)}  dt.  P(tQ)  -  P^  (26) 


where 


f,  (*(t).  t) 


»f(x(t).t) 

8*(t) 


«(t) 


x(t) 


J(t) 


and 


The  (E1-F)  bears  a  close  relationship  with  the  extended  Kalman 
filter  (EKF).  The  equations  for  the  state  estimate  of  both  the 
(El-F)  and  the  (EKF),  equations.  (21 )  and  (24).  have  the  same  structure. 
While  the  equations  for  the  gain  and  covariance  of  the  (E1-F)«  equa> 
tions  (22)  and  (23),  are  different  from  those  for  the  (EKF),  equations 
(25)  and  (26).  Equations  (22)  and  (23)  are  no  longer  state  estimate 
dependent.  Thus,  unlike  the  (EKF),  the  gain  and  covariance  for  the 
(El-F)  can  be  processed  off  line  and  prior  to  receiving  the  observations 
like  the  Kalman  filter  (KF).  Therefore,  the  El-F  will  be  of  advantage 
over  the  EKF  when  on  line  computations  of  the  gain  and  covariance  are 
not  affordable  due  to  capacity  limitations  of  on  line  computers.  This 
Is  usually  the  case  of  alrborn  and  spaceborn  computers. 

Furthermore,  while  the  (EKF)  has  to  be  strictly  Interpreted  In  the 
I tJ sense,  [62],  It  Is  not  the  case  with  the  (El-F).  This  Is  so  because 

the  gain  K(t)  as  given  by  equation  (22)  Is  not  estimate  dependent. 


B.  Numerical  Experiment  for  (El-F) 


The  Van  der  Pol  oscillator  Is  chosen  to  compare  the  following 

filters,  (El-F),  (KF),  and  (EKF).  The  Van  der  Pol  oscillator  is  charac¬ 
terized  by  the  following  differential  equation,  [24]. 


x(t)  -  cx(t)(l  -  x^(t))  +  x(t)  «  0  (27) 

which  describes  a  dynamical  system  with  state  dependent  damping  coefficient 
equals  -  c(l-x  (t))  where  e  is  a  positive  parameter.  The  damping  in 
the  system  goes  from  negative  to  zero  to  positive  values  as  the  value 
of  x‘(t)  changes  from  less  than  to  greater  than  unity.  The  oscillator's 
response  is  characterized  by  a  limit  cycle  in  the  x(t),  x(t)  plane  (the 
phase  plane).  The  limit  cycle  approaches  a  circular  shape  as  c  becomes 
very  small,  it  has  a  maximum  value  for  x(t)  equals  2.0  irrespective  of 
the  value  of  c.  This  type  of  oscillations  occur  in  electronic  tubes 
which  exhibit  also  what  is  known  as  thermal  noise.  Denoting  x(t)  as 
x.|(t),  and  x(t)  as  X2(t),  equation  (23)  can  be  rewritten  in  a  state 
space  formulation.  Also,  considering  tho  existence  of  some  noise 
forcing  on  the  system,  we  get  the  following  representation  for  the 
Van  der  Pol  oscillator. 


<i(t)  ^  fo  ii  rx^(t)i 

<2(t)J  [-1  cj  [«2{t)J 


-cx^(t)  X2(t) 


Also  suppose  that  the  following  measurement  is  taken 

dy(t)  «  [x^(t)  +  xj(t)]  dt  +  R**  dv(t)  (29) 

In  (24)  and  (25)  above  [w^(t)  WgCt)]^  is  considered  to  be  a  two 
dimensional  Wiener  process.  Also,  V(t)  is  a  one  dimensional  Wiener 
process.  R  is  a  positive  nontero  real  value,  and  Q  Is  a  (2x2)  matrix. 
The  following  values  for  noise  statistics  are  considered. 


Case  ^ 

9ll 

a 

^22 

Dl 

figures 

Van  der  Pol  1 

0.5 

0.0 

IB 

JBjM 

1  to  2 

Van  der  Pol  2 

B| 

2.0 

10.0 

3  to  4 

Also  c  is  taken  to  be  0.2 

In  the  figures,  the  following  symbols  are  used. 

XI  s  the  i—  state ,  I  ■  l ,  2 

XI K  2  the  estimate  of  the  i'^  state  provided  by  the  (K-F) 

XIE  E  the  estimate  of  the  1—  state  provided  by  the  (El-F) 

XIEK  s  the  estimate  of  the  state  provided  by  the  (EKF) 

In  both  cases;  as  indicated  by  figures  1,  2,  3,  and  4,  both  the  (El-F) 

and  (EKF)  provide  very  accurate  tracking ^of  the  system's  states  while 
the  (KF)  provides  crude  estimates. 


<  i.i.<  m^JIj  A.'j 


15 


FIGURES  CAPTIONS 


Flgure'1.  First  state  and  estimate  by  Kalman,  El,  Extended 
Kalman  filters. 


Figure  2.  Second  state  and  estimates  by  Kalman,  El,  Extended 
Kalman  filters. 


Figure  3.  First  state  and  estimates  by  Kalman,  E1 ,  Extended 
Kalman  fitlers.  . 


Figure  4,  Second  state  and  estimates  by  Kalman,  El,  Extended 
Kalman  filters. 


_ Derivation  of  the  KE-fand  the  E2N-F  filters 


We  have 


dx,(t)  -  [A{t)x^(t)  +  f(x*(t),t)]  dt  +  Q^t)  dw(t)  (30) 

dy(t)  «  [c{t)x^{t)  +  h(x*{t)*t)]  dt  +  dv(t)  (31  ) 

as  Dur  approximate  model  for  son®  given  good  guess  of  the  system  state 
x*{t). 


Then  we  seek  a  filter  which  Is  a  linear  dynamic  system,  linearly 
driven  by  the  available  observations  as  follows 


dxi(t)  •  [B(t)x^(t)]  dt  4  K(t)  dy(t) 


(32) 


where 

B(t)  Is  an  'nxn'  matrix  and  K(t)  Is  an  'nxm',  the  filter's 
gain  matrix. 

In  order  to  evaluate  the  accuracy  of  this  filter  In  estimating  the 
state  x^(t),  we  define  the  estimation  error  e^(t)  as 
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Therefore  from  feo  ),  (31),  end  (32)  we  get 

de^(t)  »  [(A(t)  -  K(t)  C(t)  -  B(t))  x,(t)  +  B(t)  e^(t) 

+  f(x*(t),t)  -  K(t)  h(x*(t),t)]  dt 
+  Q‘^(t)  dw(t)  -  K(t)  R^t)  dv(t). 
e,(to)  -  Xq  -  ^^^(to)  fc4) 

It  is  desirable  to  have  the  estimation  error  independent  of  the 
state.  In  this  case  large  state  variables  can  be  estimated  as  accurate 
as  small  state  variables. 

Therefore,  we  may  choose 
6(t)  ■  A(t)  -  K(t)  C(t). 

Hence,  the  dependence  of  the  estimation  error  on  the  state  is  eliminated. 

Also,  the  initial  minimum  variance  estimate  is  the  mean  of  the  initial 

state  . 

0 


•  Therefore, 


Hence,  equation  (34)  reduces  to 


de,(t)  -  [(A(t)  -  K{t)  C(t))  e^(t)  +  f(x*{t),t) 

-  K(t)  H(x*(t),t)]  dt  +  Q’‘(t)  dw(t) 

-  K(t)  R^^tt)  dv(t). 

Accordingly,  the  equation  for  the  mean  value  of  the  error  e^(t)  is 
as  follows. 

de,(t)  -  [(A(t)  -  K(t)  C(t))  i^tt)  +  f(x*(t).t) 

-  K(t)  h(x*(t),t)]  dt.  =  0 

It  is  clear  that  equation  (  38}  above,  due  to  the  term 
[f(x*(t),t)  -  K{t)  h{x*(t),t)],  will  have  a  non  zero  solution,  i.e. 

e^(t)  5  E{e^(t)}  i  0 

Hence  our  estimate  is  biased  unless  the  term  [f(x*(t),t)  - 
K(t)  h(x'*'(t),t)]  is  identically  equal  to  zero  for  all  values  of 
t  a 

From  equations  (37)  and  (38)  above,  we  have 

de,(t)  -  deY(t)  •  [A(t)  -  K(t)  C(t)](e^(t)  -  e,(t))  dt 
+  Q^t)  dw(t)  -  K(t)  R*=(t)  dv(t). 


By  definition  the  covariance  matrix  P(t)  is 


P(t)  -  E{(e.,(t)  -  i,(t))  (e,(t)  -  i,(t))')  (41  ) 

Therefore,  straight  forward  mathematical  manipulations  show  that  P{t) 
is  given  by  the  following  differential  equation. 

dP(t)  -  [(A(t)  -  K(t)  C(t))  P(t)  +  P(t)  (A(t)  -  K(t)  C(t))- 

+  Q(t)  +  K(t)  R{t)  K"(t)]  dt,  P(tjj)  «  Pq  (»2  ) 

Next,  we  seek  the  gain  K(s),  £  s  st  that  will  provide  the  minimum 

mean  square  error.  Therefore  we  formulate  the  following  optimization 
problem 

min  tr(P(t))  [f(x*{s),5)  -  !C(s)  h(x*(s),s)]  [f(x*(s),s) 
K(s)  •'to 

t^ssst 

-  K(s)  h(x*(s),s)]  ds  (13 ) 

Subject  to  the  constraint  given  by  (42). 

This  can  be  rewritten  as  the  following  minimization  problem, 

min  f  tr(rA(s)  -  K(s)  C(s)l  P(s)  +  P(s)  rA(s)  -  K(s)  C(s)]' 
K(s)\ 

tj,£S5t 

+  K(s)  R{s)  K'(s)  +  [f(x*(s),s)  -  K(s)  ti(x*(s)  ,s)J. 
[f(x*(s),s)  -  K(s)  h(x*(s),s)] ')  ds  (44) 


The  integrand  in  (44)  is  a  convex  quadratic  in  K(t).  According  to 
the  theory  of  calculus  of  variations,  119],  the  minimizing  K(s),  t 
given  as  the  solution  of  the  Euler's  equation  which  reduces  to  a 
simple  algebraic  equation  in  the  present  case,  namely 

3]^‘tr([A(t)  -  K(t)C{t)3Plt)  +  P(t)[A(t)  -  K{t)C(t)]' 

+  K(t)R(t)r{t)  +  [f(x*(t),t)  -  K(t)h(x*(t),t)]  [f(x*(t),t) 

« 

-  lC(t)h(x*{t),t)]0  -  0 

Using  the  concept  of  gradient  matrices  and  the  formulae  developed 
[52],  we  get 


tr(K(t)C(t)P(t))  -  P(t)C'(t) 
37^  tr{P(t)C'(t)K*(t))  *  P(t)C'(t) 
^|^tr(K(t)R(t)r(t))  -  2K{t)R(t) 

And, 


3^tr  ([f(x*(t).t)  -  K(t)h(x*(t),t)]  [f(v*(t),t) 
-  K(t)h(x*(t),t)]‘)  ^  -  2f(x*(t),t)h'(x*(t),t) 

+  2K(t)h(x*(t),t)  h*(x'»(t).t) 


s  t  is 

(45) 

n 

(46) 

(47) 

(48) 

(49) 
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Substituting  (46),  (47),  (48 )„  and  (49)  in  (45)  above,  the  opti.nial 
gain  is  found  to  satisfy  the  following  equation. 


K(t)  [R(t)  +  h(x*(t).t)  h'(x*{t}.t)]  -  P(t)C"(t) 

+  f(x*(t).t)h'(x*(t).t) 

(5o: 

Therefore,  the  solution  to  the  filtering  problem  of  the  approximate 
model  is  given  by 

« 

dH^(t)  *  A(t)  x^(t)  dt  +  K(t)  [dy(t)  -  C{t)  X^(t)  dt] 

*1  •  *0 

K(t)  •  [P(t)C-(t)  +  f{x*(t),t)  h'(x*(t).t)3  •  •  (51  ) 

[R(t)  +  h(x*(t).t)  h'(x*(t).t)]“’ 

dP(t)  «  [(A(t)  -  K(t)  C(t))  P(t)  +  P(t)  (A(t)  -  K(t)  C(t))' 

+  Q(t)  +  K(t)  R(t)  K'(t)]  dt.  P(t^)  - 

It  is  clear  that  the  inverse  in  the  gain  equation  (50)  exists  because 
R(t)  is  a  positive  definite  matrix  and  h(x*(t),t)  h'(x*(t),t)  is  always 
a  positive  semidefinite  matrix. 

Although  the  bias  term  [f(x*(t),t)  -  K(t)  h(x*(t),t)]  has  been  mini¬ 
mized,  by  choosing  the  gain  K(t)  according  to  ^0),  it  is  not 


identicallj'  zero.  The  bias  can  be  eliminated  by  modifying  the  state 
estiinate  equation  such  that  the  filter  will  be  as  follows. 


■ 

dl(^(t)  -  [A{t)  J^,(t)  +  f(x*(t).t)]  dt 

+  K(t)  [dy(t)  -  (C(t)  x^(t)  +  h(x*(t).t))  dt] 


K(t)  «  [P(t)  C'(t)  +  f(x*{t),t)  h'(x*(t),t)]  • 
[R(t)  +  h(x*(t),t)  h'(x*(t),t)]‘^ 

dP(t)  «  [(A(t)  -  K(t)  C(t))  P(,t) 

+  P(t)  {A(t)  -  K{t)  C(t)  ) 

+  Q(t)  +  K(t)  R{t)  K'(t)]  dt,  P{tp)  • 


r  (52) 


* 

Ne.'^t,  the  guessed  nominal  trajectory  x.  (t)  is  to  be  updated 
optimally  in  a  sense  to  drive  it  as  dose  as  possible  to  x<|(t).  Hence, 
the  following  minimization  problem  is  formulated. 


xl(j)  d(x*(t))  -F  Ey  {(x,(t)  -  x*(t))'(x.^(t)  -  x*(t))}  (53) 

Then  for  every  t  a  t^  setting  DJ(x*(t))/ax*(t)  •  0  we  get 
X*(t)  «  Ey^  {x^(t))  »  x^(t) 
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' 


(54) 


Now,  by  combining  the  results  in  (5’  )  and  {  54)  we  obtain  the  (E2-F) 
filter  as  follows. 


dx^(t)  »  A(t)  5^(t)  dt  +  »((t)  [dy(t)  -  C(t)  x^(t)]dt 

Xi(to)»Xo 

K(t)  -  [P(t)  C-{t)  +  f(x,{t).t)  hix^(t),t)] 

[R(t)  +  h(x,(t).t)  h'(x^(t).t)]‘’  I 

dP(t)  -  [(A{t)  -  K(t)  C(t))  P(t)  +  P(t)  (A(t)  -  K(t)  C{t))- 
+  K(t)  R(t)  K'(t)  +  Qlt)]  dt 

*  “  Pc 

And,  by  combining  the  results  in  (52)  and  {  54)  we  obtain  the  (E2N-F) 
filter  as  follows 
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---  ^  ..  M— .1--T  k..T  ’1l 


I 


dx^(t)  »  [A(t)  +fa,(t).t)]  dt 

+  K(t)  [dy(t)  -  (C{t) 

4  h(i(^(t),t))  dt] 


(56) 


K(t)  =  [P(t)  C'Ct)  +  f(x,(t).t)  h-(^J,(t),t)]  • 

[R(t)  4  h($T(t),t)  h'(^,{t),t)]"'' 

\ 

dP(t)  »  [(A(V)  -  K(t)  C(t))  P(t)  4  p(t)  (A(t)  -  K(t)  C(t))' 
4  K(t\  R(t)  K'(t)  4  Q(t)]  dt 


P(to)  «  Pq 


Few  points  should  be  inentioned  In  ccrnimentlng  on  the  results  given  by  the 
equations  In  (55)  and  (56)..  It  Isaeasy  to  recognize  that  both  the 
(E2-F)  and  the  (EZf^F)  will  rediice''ij)  the  standard  Kalman  filter  (KF) 
when  there  Is  no  nonllnearltjles  ^n^^tjie  system  structure.  The  (E2-F) 
has  a  linear  structure  for  the  stat^  estimate  equation.  But,  the 

I 

gain  matrix  K(t)  and  the  covariance  ihatrlx  P(t)  for  both  filters  In 
(55)  and  (56)  are  state  estimate  dependent,  a  common  feature  In  many 
of  the  suboptimal  nonlinear  filters.  The  results  Indicate  that  the 
measurement  nonl Inearl tle^  have  an  effect  on  the  filter  gain  similar 
to  adding  to  the  measurement  noise  by  Increasing  Its  covariance.  On 
the  other  hand  both  the  dynamics  and  the  measurements  nonllnearltles 
have  a  combined  effect  similar  to  P(t)  C'(t).  If  there  Is  no 
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measurements  nonllnearltles  (h(x(t),t}  =  0)  then  the  (E2-F)  will  reduce 
to  the  standard  Kalman  filter  (KF)  without  compensating  for  the  dynamics 
nonllnearltles,  while  (E2N-F)  will  reduce  to  the  (El-F)  given  by  equations 
(21)(22}  and(23} 

D.  NUMERICAL  EXPERIMENTS  FOR  (E2-F).  and  E2N-F); 

As  before  the  Van  der  Pol  Oscillator  Is  chosen  to  compare  the  following 
filters.  (E2-F),  and  (KF)  In  one  experiment;  and  (E2N-F),  (EKF),  and  (M2-F) 
1n  the  second  experiment. 

The  following  values  for  noise  statistics  are  considered. 


Case  if 

On 

0l2 

P22 

R 

Figures 

Van  der  Pol  1 

0.5 

0.0 

0.5 

4.0 

5  through  8 

Van  der  Pol  2 

5.0 

0.0 

5.0 

10.0 

9  through  12 

Van  der  Pol  3 

10.0 

0.0 

10.0 

20.0 

13  through  16 

As  before  c  is  taken  to  be  0.2 
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XI  £  the  1  th  state.  1-1,2 

XIK  =  the  estimate  of  the  1  ^  state  provided  by  the  (k-F) 
XiE  s  “  “  "  •'  "  “  *'  '*  the  (E2-F) 


XIEN  =  " 


XIEK  =  •' 


XIM  £  •• 


II  II  II  II  II 


11  II  II  II  II 


II  II  II  It  II 


II  II  II  II  II 


•'  the  (E2N-F) 
"the  (EKF) 
the  (W2-F) 


'  >  e.T  •4.1  ^ 


Figure  Captions 


Figure  5.  First  state  and  estimates  by  Kalman  and  E2  Filters. 

Figure  6.  Second  state  and  estimates  by  Kalman  and  E2  Filters. 

Figure  7.  First  state  and  estimates  by  Extended  Ka1man»  E2N,  and 
modified  second  order  filters. 

Figure  8.  Second  state  and  estimates  by  Extended  Kalman,  E2N,  and 
modified  second  order  filters. 

Figure  9.  First  state  and  estimates  by  Kalman  and  E2  Filters. 

Figure  1C.  Second  state  and  estimates  by  Kalman  and  E2  Filters. 

Figure  11.  First  state  and  estimates  by  Extended  Kalman,  E2N,  and 
modified  second  order  filters. 

Figure  12.  Second  state  and  estimates  by  Extended  Kalman,  E2N,  and 
modified  second  order  filters. 

Figure  13.  First  state  and  estimates  by  Kalman  and  E2  Filters. 

t 

Figuri  14.  Second  state  and  estimates  by  Kalman  and  E2  Filters. 

Figure  IS.  First  state  and  estimates  by  Extended  Kalman,  E2N,  and 
modified  second  order  filters. 

Figure  16.  Second  state  and  estimates  by  Extended  Kalman,  E2N,  and 
modified  second  order  filters. 


From  the  previously  displayed  sets  of  results  the  following  can  be 
pointed  out.  In  the  first  two  sets  of  results.  Van  der  Pol  1  and  Van  der 
Pol  2  the  {E2-F)  provides  a  better  tracking  of  the  system  states  than  the 
(K-r).  This  is  evident  from  figures  5,  6,  9,  and  10.  It  is  clear  that 
the  (E2N-F)  has  accuracy  similar  to  that  of  the  (EKF)  which  is  better  than 
the  {M2-r)  as  Indicated  by  figures  7,  8,  11,  and  12.  In  the  third  set  of 
results,  Van  der  Pol  3  figures  13  through  16  the  noise  level  is  high  enough 
to  cover  the  effect  of  the  system  nonlinearitles.  Therefore,  all  filters 
except  the  (M2-F)  have  similar  performance.  The  (M2-F)  is  badly  degraded 
and  provides  a  crude  estimate  of  the  system  state. 
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IV.  CONCLUSIONS; 

#hc  nonlinear  filtering  problem, Is  treated  using  a  new  approach. 

The  approach  consists  of  unifying  a  system  model  approximation  technique 
with  the  filtering  solution  based  on  the  approximate  model . ^As  a  result, several 
filters  are  developed. 


^  The  first  filter  (El-F)  structurally  fits  into  the  gap  between  the  Kalman 
(KF)  and  the  extended  Kalman  {EKF)  filters.  On  one  hand  it  enjoys  the  same 
computational  facility  enjoyed  by  the  Kalman  filter,  namely,  the  off-line 
computations  of  its  gain  matrix.  And  on  the  other  hand  it  provides  state 
estimates  on  the  same  level  or’  accuracy  as  provided  by  the  extended  Kalman 
filter.  Therefore,  in  this  sense  the  (El-F)  provides  a  missing  link  between 
(KF)  and  (EKF), 


i  The  other  two  filters  are  referred  to  as  the  {E2-F)  and  the  (E2N-F).  The 

i 

'  state  estimate  provided  by  the  (E2-Fj  has  a  structure  like 
(KF)  while  that  of  the  (E2N-F)  has  a  structure  like  the  (EKF).  Both  filters 
have  new  formula  for  the  gain  which  provides  further  insight  into  the  effects 
of  the  system  nonlinearities.  Specifically,  measurements  nonlinearities  have 
the  effect  of  increasing  the  measurenents  noise  level.  Moreover,  the  dynamics 
nonlinearities,  and  also  the  measurements  nonlinearities  have  a  combined  effect 
similar  to  the  P(t)C'‘(t)  term  in  the  Kalman  filter. 

In  conclusion,  the  contribution  of  this  chapter  Is  in  providing  three  new 
practically  impl ementa bl e  filters  for  stochastic  dynamic  systems  which  include 
nonlirecrities  in  their  structure. 
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